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Abstract. Building on earlier work introducing the notion of "mod- 
Gaussian" convergence of sequences of random variables, which arises 
naturally in Random Matrix Theory and number theory, we discuss the 
analogue notion of "mod-Poisson" convergence. We show in particu- 
lar how it occurs naturally in analytic number theory in the classical 
Erdos-Kac Theorem. In fact, this case reveals deep connections and 
analogies with conjectures concerning the distribution of L-functions on 
the critical line, which belong to the mod-Gaussian framework, and with 
analogues over finite fields, where it can be seen as a zero-dimensional 
version of the Katz-Sarnak philosophy in the "large conductor" limit. 



1. Introduction 

In our earlier paper with J. Jacod0 motivated by results from Ran- 
dom Matrix Theory and probabihty, we have introduced the notion of mod- 
Gaussian convergence of a sequence of random variables (Zn)- This occurs 
when the sequence does not (typically) converge in distribution, so the se- 
quence of characteristic functions does not converge pointwise to a limit 
characteristic function, but nevertheless, the characteristic functions decay 
precisely like a suitable Gaussian, i.e., the limits 

lim exp(-m/3jv -Fii27^/2)E(e™^^) (1.1) 

exist, locally uniformly for u G R, for some parameters {Pn^Jn) € R x 
[0,+oo[. 

Besides giving natural and fairly general instances of such behavior in 
probability theory, we investigated arithmetic instances of it. In that re- 
spect, we noticed that the limits (jl.ip can not exist if the random variables 
are integer- valued, since the characteristic functions E(e*"'^^) are then 
27r-periodic, and we discussed briefly the possibility of introducing "mod- 
Poisson convergence", that may be applicable to such situations. Indeed, 
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we noticed that this can be seen to occur in number theory in one approach 
to the famous Erdos-Kac Theorem. 

In the present paper, we look more deeply at mod-Poisson convergence. 
We first recall the definition and give basic facts about mod-Poisson con- 
vergence in Sections [2] and El Sections [4] and [5] consider number-theoretic 
situations related to the Erdos-Kac Theorem. We show that the nature of 
the mod-Poisson convergence parallels closely the structure of conjectures 
for the moments of zeta functions on the critical line. This becomes es- 
pecially clear over finite fields, leading to very precise analogies with the 
Katz-Sarnak philosophy and conjectures. In fact, in Section [6l we prove a 
version of the mod-Poisson convergence for the number of irreducible factors 
of a polynomial in Fq[X], as the degree increases, which is a zero-dimensional 
case of the large conductor limit for L-functions (see Remark 15.11 and The- 
orem 16. 4p . Our proof convincingly explains the probabilistic features of the 
limiting function, involving both local models of primes and large random 
permutations. 

Notation. In number-theoretic contexts, p always refers to a prime num- 
ber, and sums and products over p (with extra conditions) are over primes 
satisfying those conditions. 

For any integer d ^ 1, we denote by &d the symmetric group on d letters 
and by the set of its conjugacy classes. Recall these can be identified 
with partitions of d, where the partition 

n = 1 • ri H \-d-rd, Tj ^ 0, 

corresponds to permutations with ri fixed points, r2 disjoint 2-cycles, 

disjoint d-cycles. For a E we write for its conjugacy class. We 
denote by zu{a) the number of disjoint cycles occurring in a. 

By f <^ g for x €z X, or f = 0{g) for x € X, where X is an arbitrary set 
on which / is defined, we mean synonymously that there exists a constant 
C ^ such that |/(x)| ^ Cg{x) for all x ^ X. The "implied constant" 
refers to any value of C for which this holds. It may depend on the set X, 
which is usually specified explicitly, or clearly determined by the context. 

Acknowledgments. We thank P-0. Dehaye and A.D. Barbour for in- 
teresting discussions related to this paper, and P. Bourgade for pointing out 
a computational mistake in an earlier draft. Thanks also to the referee for 
a careful reading of the manuscript. 

The second author was partially supported by SNF Schweizerischer Na- 
tionalfonds Projekte Nr. 200021 119970/1. 

2. General properties of mod-Poisson convergence 

Recall that a Poisson random variable P\ with parameter A > is one 
taking (almost surely) integer values k ^ with 

P(Pa = k) = ^e~\ 
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Its characteristic function is then given by 

E(e™-f'^) = exp(A(e^" - 1)). 

Definition 2.1. We say that a sequence of random variables (Z^) converges 
in the mod-Poisson sense with parameters Ajv if the fohowing limits 

lim E(e™-f'^^)-^E(e*"^^) = lim exp(A^(l - e™)) E(e^"^^) = 

exist for every u G R, and the convergence is locally uniform. The limiting 
function $ is then continuous and $(0) = 1. 

Example 2.2. (1) The simplest case of mod-Poisson convergence (which jus- 
tifies partly the name) is given by 

Zn = Pxm + Z (2.1) 

where P\j^ is a Poisson variable with parameter Aat, while Z is an arbitrary 
random variable independent of all P\jv • that case, the limiting function 
is the characteristic function E(e™^) of Z. 

(2) Often, and in particular in the cases of interest in the arithmetic 
part of this paper, Zj\f is (almost surely) integer- valued; in that case, its 
characteristic function is 27r-periodic, and it follows that if the convergence 
is locally uniform, then it is in fact uniform for ti G R. However, this is not 
always the shown by examples like (12. ip if the fixed random variable 
Z is not itself integer- valued. 

(3) A.D. Barbour pointed out to us the paper pjj of H-K. Hwang. Hwang 
introduces an analytic assumption [11, (1), p. 451] on the probability gener- 
ating functions of integer- valued random variables {X^), i.e., on the power 
series 

j;P(X^ = n)z" = E(z^^), 

which is very closely related to mod-Poisson convergence. This assumption 
is used as a basis to deduce results on Poisson approximation of the sequence 
(see Proposition 12.51 below for a simple example). Hwang also gives many 
additional examples where his assumption holds. 

If we have mod-Poisson convergence with parameters (Atv) which con- 
verge, then {Z]\f) converges in law. Such a situation arises for instance in 
the so-called Poisson convergence (see, e.g.,j3), p. 188]), which we recall: 

Proposition 2.3. Let (xj^^) be an array of independent random variables, 
identically distributed in each row, according to a Bernoulli distribution with 
parameter x„ ; 

P(xf") = l)=Xn and P(xf"^ = 0) = 1 - x„ for I ^ i n. 

Set Sn = ^ + . . . + Xt^ . Then, 5„ converges in distribution if and only 
if nxn ^ A > 0, when n — > oo. The limit random variable S is a Poisson 
random variable with parameter A. 
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We will state an analogue of Poisson convergence in the mod-Poisson 
setting in the next section, but first we discuss some basic consequences. 
The link with mod-Gaussian convergence in the last part of the next result 
is quite intriguing. 

Proposition 2.4. Let (Zjsf) be a sequence of random variables which con- 
verges in the mod-Poisson sense, with parameters Xn, such that 

lim Xjy = oo. 

Then the following hold: 

(1) The re-scaled variables Z^/Xn converge in probability to 1, that is, 
for any e > 0, 

Zn 



lim P 



Xn 



> e = 0. 



(2) We have the normal convergence 

Zn — Aiv law 



M{0,1), (2.2) 



where M is a standard Gaussian random variable. 
(3) The random variables 

^ . Zn — Aiv 

^N 

converge in the mod-Gaussian sense (jl.ip with parameters (0, ) and uni- 
versal limiting function 

<^>u{t) = exp(-itV6). 

Proof. This is a very standard probabilistic argument, but we give details 
for completeness. 

(1) For G R, we write 

u 

Xn 

(note that s depends on N and s — > when N — > +oo). By the definition 
of mod-Poisson convergence (in particular the uniform convergence with 
respect to u), we have 

lim exp(Ajv(l - e^")) B(e''^^) = $(0) = 1. 

The fact that 

exp(A7v(e^'' - 1)) = exp((is + 0{s'^))Xn), 

yields 

lim E(e'"^^/^^) = e^". 

Consequently, {Zn /Xn) converges in distribution to 1 and hence converges 
in probability since the limiting random variable is constant. 
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(2) For n G R, we now write 



u 

t : 



(note that t depends on and t ^ when N — > +cxd). 

Again, by the definition of mod-Poisson convergence (in particular the 
uniform convergence with respect to n), we have 

hm exp(Aiv(l - e^*)) E(e**^^) = $(0) = 1. (2.3) 

Moreover, we have 

exp(A^(e** - 1)) = exp((it - /2 + 0{t^))\N) 

= exp(inV^- y + (2.4) 

Let 

Tat — Aat 



Yn 



V Aat 
We have then 

E(e*"^^) = exp(-iuyA^) E(e^*^^). (2.5) 

Writing as 

exp(-m\/A^) X exp((e^* - l)AAr) x exp((l - e**)A7v) E(e**^'^), 
we see from (12.31) and (12.41) that this is 



exp(^-y + 0(^^j j (1 + 0(1)) ^exp(^-yj, as N ^ +oo, 

and by Levy's criterion, this concludes the proof. 

Part (3) is a similar straightforward computation, which we leave as an 
enlightening exercise. □ 

In stating the renormalized convergence to a Gaussian variable, there is 
a loss of information, since the "Poisson nature" of the sequence is lost. 
This is illustrated further by the following result which goes some way to- 
wards clarifying the probabilistic nature of mod-Poisson convergence. We 
recall that the Kolmogorov-Smirnov distance between real-valued random 
variables X and Y is defined by 

dKs{X,Y) = sup|P(X < x)-P{Y ^ x)\. 

a;eR 

Proposition 2.5. Let (Zjsf) be a sequence of random variables which are 
a.s. supported on positive integers, and which converges in the mod-Poisson 
sense for some parameters (Atv), such that Ajv oo when N oo. Assume 
further that the characteristic functions E(e™'^^) are of class and the 
convergence holds in C"^ topology. 
Then we have 

lim dKs{ZN,P\i,) = 
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where is a Poisson random variable with parameter A at, and in fact 

dKs{ZN,Px^) ^ W\\ooX-n'\ 

for 1. 



Proof. We recall the following well-known inequality, which is the ad-hoc 
tool (see, e.g. p!5^, p. 186, 5.10.2]): if X and Y are integer-valued random 
variables, then 



dKs{X,Y) ^ 



1 



Efe' 



E(e*"'*^) 



du. 



Let 



From the inequality, we obtain 

dKs{ZN,Px^) ^ \ r |E(e»^-) - Viv(n)|- 

,^n{u) - 1 



iPn{u)- 



u 



du 



Prom our stronger assumption of mod- Poisson convergence with con- 
vergence, we have a uniform bound 

^n{u) - 1 



u 



^ II ^>', 



for N 1, hence since |^'Ar(u)| = exp(AAr(cosu — 1)), we have 

l$'IU 



dKs{ZN,P\i,) ^ 



,Aiv(cos«-l)^^_ 



It is well-known that the precise asymptotic of such an integral gives order 

— 1/2 

of magnitude for Aat — > +oo. To see this quickly, note for instance 

that cosn — 1 ^ —v?/b on [— 7r,7r], hence 



R 



Svr 
Aat 



which gives the result since \/57r/4 ^ 1. 



□ 



Remark 2.6. (1) Hwang [111 Th. 1] gives this and many other variants for 
other measures of approximation, under the assumption of his version of 
mod-Poisson convergence. In another work with A. Barbour, we consider 
various refinements and applications of this type of statement, including with 
approximation involving more general families of discrete random variables 
(see m- 

(2) As a reference for number theorists, note that the existence of renor- 
malized convergence as in (12. 2p for an arbitrary sequence of integer-valued 
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random variables {Zjy), with Ei{Ziy) = Xjy, does not imply that the Kol- 
mogorov distance dKsiZN , Pxj^) converge to 0: indeed, consider 

Zat = Bi + ■ ■ ■ + Bjy 

where the Bi are Bernoulli random variables with P{Bi = 1) = P{Bi = 
0) = ^. Then A^v = and the normalized convergence in law (j2.2p is the 
Central Limit Theorem. However, it is known that, for some constant c > 0, 
we have 

dKs{ZN,Pxr,) >c>0 
for all (see, e.g., |2l Th. 2], for the analogue in total variation distance, 
which in that case is comparable to the Kolmogorov distance \18\ Prop. 1]). 

3. Limit theorems with mod-Poisson behavior 

Now we give an analogue of the Poisson convergence in the mod-Poisson 
framework. 

Proposition 3.1. Let (xn) of positive real numbers with 

^x„ = +oo, ^x^<+oo, (3.1) 

and let {Bn) he a sequence of independent Bernoulli random variables with 

P(S„ = 0) = 1 - P(5„ = 1) = Xn. 

Then 

Ztv = Bi + ■ ■ ■ + Bj\f 
has mod-Poisson convergence with parameters 

Xn = xi-\ \-XN 

and with limiting function given by 

^u) = n (1 + - 1)) exp(x„(l - e*")), 

a uniformly convergent infinite product. 

Proof. This is again a quite simple computation. Indeed, by independence 
of the variables Bn, we have 

N 

exp(A^(l - en) E(e»^'-) = J] exp(x„(l - e^"))(l + x„(e^" - 1)), 

n=l 

and since 

exp(:E„(l - e^"))(l + x„(e^" - 1)) = 1 + Oixl) 

for u € R and n ^ 1 (recall x„ — > 0), it follows from (j3.ip that this product 
converges locally uniformly to ^{u), which completes the proof. □ 
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Remark 3.2. More generally, assume that (X^^) is a triangular array of 
independent random variables taking values in {0, ai, . . . , a^}, such that 

P[4") =ai] =x«; i = l,...,r. 

Assume that for any i, J2n^i^n^ — °° ^^"^ Sn^i(^n'*)^ < Then S'n = 
converges in the mod-Poisson sense with parameter Xn = 

(1) I , (r) 

4. MOD-POISSON CONVERGENCE AND THE ErDOS-KAC THEOREM: A 

FIRST ANALOGY 

In [121 §4.3], we gave the first example of mod-Poisson convergence as 
explaining (through the Central Limit of Proposition [2^ the classical result 
of Erdos and Kac concerning the statistic behavior of the arithmetic function 
function uj{n), the number of (distinct) prime divisors of a positive integer 
n ^ 1: 

1 , r oj(n) — log log N _ , 1 +2 /r, , , , , 

for any real numbers a < b. 
More precisely, with 

cj'(n) = uj(n) — 1, for n ^ 2, 

we showed by a simple application of the Delange-Selberg method (see, 
e.g., [2m II. 5, Theorem 3]) that for any li € R, we have 

hm y e-'W = c^(n), 

and the convergence is uniform, with 

where the Euler product is absolutely and uniformly convergent: this means 
mod-Poisson convergence with parameters Xn = log log A^. By Proposi- 
tion [231 (2), this implies (liT]) FI To illustrate what extra information is con- 
tained in mod-Poisson convergence we make two remarks: first, by putting 
u = TT, for instance, we get 

^ ^ ^ V(logiV)2 

as — > +00 (since l/r(l + e*'^) = 0), which is a statement well-known to be 
equivalent to the Prime Number Theorem. Secondly, more generally, we can 
apply results like Proposition 12.51 (which is easily checked to be applicable 



^ As we observed, this gives essentially the proof of the Erdos-Kac theorem due to 
Renyi and Turan [16]. For another recent simple proof, see [9]. 
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here) to derive Poisson-approximation results for u;(n) which are much more 
precise than the renormahzed Gaussian behavior (see also §4] and |20^ 
§6.1] for the discussion of the classical work of Sathe and Selberg). 

We wish here to bring to light the very interesting, and very complete, 
analogy between the probabilistic structure of this mod-Poisson version of 
the Erdos-Kac Theorem and the mod-Gaussian conjecture for the distribu- 
tion of the values L-functions, taking as basic example the conjecture for 
the distribution of log |C(l/2 + it)\, which follows from the Keating-Snaith 
moment conjectures for the Riemann zeta function (see jl2j . |14j). 

We start with the observation, following from (14. 2p . that the limiting 
function $(ii) in the Erdos-Kac Theorem takes the form of a product ^(u) = 
^i{u)^2{u) with 

1 / 1 \ e'" / p*" 

f(?^- *^(")=ri(i-p) 

We compare this with the Moment Conjecture in the mod-Gaussian form, 
namely, if U is uniformly distributed on [0,T], it is expected that 

lim e''''°si°g^E(e*'^i°glf(^/2+if/)pN ^ . N^^/ X /4_3^ 

for all n G R (locally uniformly) where 
^ , , G(l + m)2 

(G(z) is the Barnes double-gamma function, see e.g. [2H Ch. XII, Misc. Ex. 
48] ), and 

*^(«)-n(-i)-"'{E(^)Vi. (4.5, 

Here also, the limiting function splits as a product of two terms, and 
each appears individually as limit in a distinct mod-Gaussian convergence. 
Indeed, we first have 

^i(n)= lim e«'(iogiV)E(e™i°sl'i^t(i-^^)l'), 

N—*+oo 

where X^r is a Haar-distributed ?7(A^)-valued random variable. Secondly 
(see [I2l 4.1]), we have 

^2(u)= lim e"'('°s('=^^°sAf))E(e™^^) 

where 

Lat = ^ log|l 

for any sequence {Op)p^N of independent random variables, uniformly dis 
tributed on [0, 1]. 



Q^^P 2 
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Remark 4.1. Note in passing that for fixed p, the p-th component of the Euler 
product of C(l/2 + iU), for U uniformly distributed on [0, T], converges in 
law to (1 - e^'^pp-^/'^)-^ as T ^ +oo. 

We now prove that the Euler product $2 (like ^'2) corresponds to mod- 
Poisson convergence for a natural asymptotic probabilistic model of primes, 
and that <I>i (like ^'i) comes from a model of group-theoretic origino 

We start with the Euler product, where the computation was already 
described in [12l §4.3]: we have 

^ Is e™— 1 / In/ 

*^w=„i!?.n 1-;; 1-;; 0+ 



oo-^-^\ pj \ pj \ p — 1 



and by isolating the first term, it follows that 



S/^+oo J- ^ p p 

= lim E(e™^^^')-iE(e^"^^) 

where 

P^y p^y 

fc^l 

as y — > +00, for some real constant k (see, e.g., [lOl §22.8]), and 

Zy = EK (4-6) 

p^y 

is a sum of independent Bernoulli random variables with parameter 1/p: 

p(i?; = i) = -, p(i?; = o) = i--. 

p ^ p 

We note that this is a particular case of Proposition 13.11 and that (as 
expected) the parameters of these Bernoulli laws correspond exactly to the 
"intuitive" probability that an integer n be divisible by p, or equivalently, 
the Bernoulli variable Bp is the limit in law as — > +00 of the random 
variables defined as the indicator of a uniformly chosen integer n ^ N being 
divisible by p; the independence of the B'^ corresponds for instance to the 
formal (algebraic) independence of the divisibility by distinct primes given, 
e.g., by the Chinese Remainder Theorem. 

As in the case of the Riemann zeta function, we also note that the in- 
dependent model fails to capture the truth on the distribution of uj{n), the 



Since a product of two limiting functions for mod-Poisson convergence is clearly 
another such limiting function, we also recover without arithmetic the fact that the limiting 
function arises from mod-Poisson convergence. 
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extent of this failure being measured, in some sense, by the factor <I>i(m). 
Because 



Z'y - log logy law 



AA(0,1), 



Vlog log y 

this discrepancy between the independent model and the arithmetic truth is 
invisible at the level of the normalized convergence in distribution (as it is 
for log |C(l/2 + ii)|, by Selberg's Central Limit Theorem, hiding the Random 
Matrix Model). 

Now we consider the first factor ^i{u) = r(e*" + 1)^"*^. Again, in [12^ 
§4.3], we appealed to the formula 

r(e» + 1) ~ V^^JV^k 
for n G R (see [211 12.11]) to compute 

-i-r / l\l-e'" / / e*" 

„!i?„n(i+j) (i+i) (i+T 

hm exp(AMl-e»))n l + r 1 + ^ 



N^+oo \ kJ V k 

= lim exp(A7v(l-e^"))E(e*"^'^), 

where 

Xn= Yl log(l + A;-^) =log(A^ + l), 
and Zat is the sum 

Zjy = Bi + B2 + • • • + Bjy, 

with B/. denoting independent Bernoulli random variables with distribution 

p(Bk = 1) = 1 ^ = P(Bk = 0) = = — 

^ " ' 1 + i fc + 1' ^ " ' 1 + i k + l 

The group-theoretic interpretation of this distribution is very suggestive: 
indeed, it is the distribution of the random variable w{aN+i) — 1, where 
(Jn+i S ©AT+i is distributed according to the uniform measure on the sym- 
metric group, and we recall that w{a) is the number of cycles of a permu- 
tation. In other words, we have 

as proved, e.g., in [T] §4.6]; note that this is not obvious, and the decom- 
position as a sum of independent random variables is due to Feller, and is 
explained in [U p. 16]. 
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So we see - and this gives another example of natural mod-Poisson con- 
vergence - that these random variables have mod-Poisson convergence with 
parameters logA^, and limiting function l/r(e*"): 

hm exp((logiV)(i_e»))E(e»-('^-)) = -i-. (4.8) 

For further reference, we state a more precise version, which follows 
from (|T71) : 

E(e»-(-iv)) = _l_exp((logiV)(e» - i))(i + o(l)), (4.9) 

locally uniformly for u £ Tl. Note that this includes the special case u = 
{2k + l)7r where 

^ ' r(e™) 

This explanation of the "transcendental" factor l/r(e*"+l) is particularly 
convincing because of well-known and well-studied analogies between the 
cycle structure of random permutations and the factorization of integers 
(see, e.g., the discussion in [H §1.2] or the entertaining survey [5). Its origin 
in [12t 4.3] is, however, not very enlightening: the Gamma function appears 
universally in the Delange-Selberg method in a way which may seem to be 
coincidental and unrelated to any group-theoretic structure (see, e.g., [20^ 
§5.2] where it originates in a representation of l/r(z) as a contour integral 
of Hankel type). 

5. The analogy deepens 

The discussion of the previous section is already interesting, but it be- 
comes (to our mind) even more intriguing after one notes how the analogy 
can be extended by including consideration of function field situations, as 
in the work of Katz-Sarnak |13j . 

Let Fg be a finite field with q = elements, with n ^ 1 and p prime. 
For a polynomial / € [X] , let 

uj{f) = ujq{f) = |{7r E Fg[X] I TT is irreducible monic and divides /}| 

be the analogue of the number of prime factors of an integer (we will usually 
drop the subscript q). 

We consider the statistic behavior of this function under two types of 
limits: (i) either q is replaced by q"^, m — > -|-oo, and / is assumed to range 
over monic polynomials of fixed degree d ^ 1 in Fgm[X]; or (ii) q is fixed, 
and / is assumed to range over monic polynomials of degree d — > -|-oo in 

The first limit, of fixed degree and increasing base field, is similar to the 
one considered by Katz and Sarnak for the distribution of zeros of families of 
L- functions over finite fields [13]. And the parallel is quite precise as far as 
the group-theoretic situation goes. Indeed, recall that the crucial ingredient 
in their work is that the Frobenius automorphism provides in a natural way 
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a "random matrix" for a given L-function, the characteristic polynomial of 
which provides a spectral interpretation of the zeros (see, e.g., [121 §4.2] for 
a partial, down-to-earth, summary). 

In our case, let us assume first that / G Fg[X] is squarefree. Let Kf 
denote the splitting field of /, i.e., the extension field of Fg generated by 
the d roots of /, and let Ff denote the Frobenius automorphism x x'^ 
of Kf. This automorphism permutes the roots of /, which all lie in Kf, 
and after enumerating them, leads to an element of &d, denoted Ff. This 
depends on the enumeration of the roots, but the conjugacy class Fj G 
is well-defined. 

Now, by the very definition, we have 

u;if)=rniFj), (5.1) 

which can be seen as the (very simple) analogue of the spectral interpre- 
tation of an L-function as the characteristic polynomial of the Frobenius 
endomorphism. 

Remark 5.1. We can come even closer to the Katz-Sarnak setting of fam- 
ilies of L-functions. Consider, in scheme-theoretic language^ the (very 
simple!) family of zeta functions of the zero-dimensional schemes Xf = 
Spec(Fg[X]/(/)), i.e., the varieties over Fg with equation f{x) = 0. These 
zeta functions are defined by either of the following two formulas: 

where \Xf \ is the set of closed points of Xf. Since these correspond naturally 
to irreducible factors of / (without multiplicity) , it follows that 

Z{Xf) = ]J(l -T'i^gW)-!, 

and hence, if / is squarefree, a higher-level version of (15. ip is the "spectral 
interpretation" 

Z{Xf) = det(l - FfT\H°,{Xf, Q^))~i = det(l - p{Ff)Ty^ (5.2) 
where Ff is still the Frobenius automorphism, H^{Xf,Q£) is simply iso- 
morphic with Q^*'^^'^^ (the variety over the algebraic closure has deg(/) con- 
nected components, which are points), and p is the natural faithful repre- 
sentation of Sdcg(/) ill ^(deg(/),C) by permutation matrices, since this is 
quite clearly how Ff acts on the etale cohomology space. 

Looking at the order of the pole of Z(Xf) at T = 1, we recover ()5.ip . 
In particular, the generalizations of the Erdos-Kac Theorem that we will 
prove in the next section can be interpreted as describing the limiting sta- 
tistical behavior, in mod-Poisson sense, of the order of the pole of those zeta 



Readers unfamiliar with this language can skip this remark, which will not be used, 
except to state Theorem l6.4l b elow . 
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functions as the degree deg(/) tends to infinity (see Theorem I6.4p . It is 
truly a zero-dimensional version of the Katz-Sarnak problematic for grow- 
ing conductor. (Note that this interpretation also suggests to look at other 
distribution statistics of these zeta functions, and we hope to come back to 
this). 

The relation (j5.ip (or (j5.2p ) explains the existence of a link between the 
number of irreducible factors of polynomials and the number of cycles of per- 
mutations. Indeed, the other essential number-theoretic ingredient for Katz 
and Sarnak is Deligne's Equidistribution Theorem, which shows that the 
matrices given by the Frobenius, in the limit under consideration where q is 
replaced by g™", m — > -|-oo, become equidistributed in a certain monodromy 
group. Here we have, exactly similarly, the following well-known: 

Fact. In the limit of fixed d and m — > -|-oo, for / uniformly chosen among 
monic squarefree polynomials of degree d in Fgm[X], the conjugacy classes 
Pj become uniformly distributed in for the natural (Haar) measure. 

This fact is easily proved from the well-known Gauss-Dedekind formula 

n,(^)= E i = ^EM5)«'^' = 7 + o('/^^') 

dog(7r)=d 5\d 

for the number of irreducible monic polynomials of degree d with coefficients 
in Fq, and it is a "baby" analogue of Deligne's Equidistribution Theorem^ 
Hence, we obtain 

uj[f) w{ad), 

as m ^ +00, where / is distributed uniformly among monic polynomials of 
degree d in Fq™[X], and is distributed uniformly among S^. 

The second limit, where the base field Fg is fixed and the degree d grows, 
is analogue of the problematic situation of families of curves of increasing 
genus over a fixed finite field (see the discussion in [131 P- 12]), and - for our 
purposes - of the distribution of the number of prime divisors of integers, 
which we discussed in the previous section. In the next section, we prove 
a mod-Poisson form of the Erdos-Kac theorem in Fg[X] (the Central Limit 
version being a standard result, essentially due to M. Car, and apparently 
stated first by Flajolet and Soria [U §3, Cor. 1]; see also the recent quick 
derivation by R. Rhoades \T7])- 

Remark 5.2. One may extend the conjugacy class Pj € to all / G Fq[X] 
of degree d, in the following directly combinatorial way (which hides the 
Frobenius aspect): Pj is the conjugacy class of permutations with as many 
disjoint j-cycles, 1 ^ j ^ d, as there are irreducible factors of / of degree 
j. However, the relation a;(/) = w{Pf) does not extend to this case, since 



^ Indeed, it could be proved using the Chebotarev density theorem, which is a special 
case of Deligne's theorem. 
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multiple factors are not counted by ui. However, we have = ro(F'), 

where is the number of irreducible factors counted with multiplicity. 

6. MOD-POISSON CONVERGENCE FOR THE NUMBER OF IRREDUCIBLE 
FACTORS OF A POLYNOMIAL 

In this section, we state and prove the mod-Poisson form of the analogue 
of the Erdos-Kac Theorem for polynomials over finite fields, trying to bring 
to the fore the probabilistic structure suggested in the previous section. 

Theorem 6.1. Let q ^ 1 be a power of a prime p, and let uj{f) denote 
as before the number of monic irreducible polynomials dividing / G Fg [X] . 
Write \g\ = q'^'^^'^a) = \Fq[X]/{g)\ for any non-zero g £ Fg[X]. 
For any u £ R, we have 

lim ^Mil-epiogd) y ,..K/)-i)^$^(,)^^(,)^ (6.1) 
d— ►+00 q"- 

where 
and 



deg(/)=d 



4i(") = (6.2) 



vr / V vr 

7r ' ' ' ' 

the product running over all monic irreducible polynomials vr G Fg[X] and 
the sum over all monic polynomials f G F|j[X] with degree deg(/) = d. 
Moreover, the convergence is uniform. 

Remark 6.2. Note the similarity of the shape of the limiting function with 
that in ()4.2p and the conjecture for C(l/2 + it), in particular the fact that 
the group-theoretic term is the same as for uj{n), while the Euler product is 
a direct transcription in Fg[X] of the earlier <I>2- 

Remark 6.3. This can be rephrased, according to Remark 15. 11 in the follow- 
ing manner which illustrates the analogy with the Katz-Sarnak philosophy: 

Theorem 6.4. Let q ^ 1 be a power of a prime. For any f G Fg[X], monic 
of degree ^ 1, let Xf be the zero- dimensional scheme Spec(Fq[X]/(/)), let 
Z{Xf) G Q(T') denote its zeta function and let r{Xf) ^ denote the order 
of the pole of Z{Xf) at T = 1. Then for any u G R, we have 

exp((l-e-)logci) y 

deg(/)=d 

with notation as before. 

The only thing to note here is that if / is not squarefree, the scheme 
Xf is not reduced; the induced reduced scheme is X^i,, where f^ is the 
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(squarefree) product of the distinct monic irreducible factors dividing /. 



-r{f) = ordT=i Z{Xf) = ordT=i Z{Xf,) = -r{f) = u;{f) = u;{f), 



so the two theorems are indeed equivalent. 

Remark 6.5. One can also prove by the same method the following two vari- 
ants, where we restrict attention to squarefree polynomials, or we consider 
irreducible factors with multiplicity. First, we have 



where the sum runs over all squarefree monic polynomials / G Fg[X] 
with degree deg(/) = d. Next, we have 



We now come to the proof. The idea we want to highlight - the source 
of the splitting of the limiting function in two parts of distinct probabilistic 
origin - is to first separate the irreducible factors of "small" degree and 
those of "large" degree (which is fairly classical), and then observe that an 
equidistribution theorem allows us to perform a transfer of the contribution 
of large factors to the corresponding average over random permutations, 
conditioned to not have small cycle lengths. This will explain the factor 
<I>i corresponding to the cycle length of random permutations. Note that 
shorter arguments are definitely available, using analogues of the Delange- 
Selberg method used in [12] (see [6l §2, Th. 1]), but this hides again the 
mixture of probabilistic models involved. 

Interestingly, the small and larger irreducible factors are not exactly in- 
dependent. But the dependency is (essentially) perfectly compensated by 
the effect of the conditioning at the level of random permutations. Why this 
is so may be the last little mystery in the computation, which is otherwise 
very enlightening. 

We set up some notation first: for / G Fq[X], we let d^{f) (resp. d^{f)) 
denote the largest (resp., smallest) degree of an irreducible factor vr | /; 
correspondingly, for a permutation a G 6d, we denote by i^icr) (resp. i~ (a)) 
the largest (resp. smallest) length of a cycle occurring in the decomposition 
of a. 

Henceforth, by convention, any sum involving polynomials /, g, h, etc, is 
assumed to restrict to monic polynomials, and any sum or product involving 
vr is restricted to monic irreducible polynomials. 

The next lemma summarizes some simple properties, and the important 
equidistribution property we need. 



Then Z{Xf) = Z(Xjb), and we have 
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Lemma 6.6. With notation as above, we have: 

(1) For all d ^ 1, we have 

deg(7r)=(i 

(2) For all d ^ 1, we have 

n (6.4) 

deg(7r)<d 

n 0-r)--p(-e})0+oG))- («-^) 

(3) For any d ^ 1 and any fixed permutation a £ 6^, we have 
1 1 = P(^'^ = ^)(i + 0(-F^))> (6-6) 



deg(/)=rf 

where the conjugacy class Fj G is defined in the previous section, ad is 

b 

a uniformly chosen random permutation in &d and ^ restricts the sum to 
squarefree polynomials. 

In all estimates, the last under the assumption i'^)/'^ ^ d, the implied 
constants are absolute, except that in (|6.4|) . the implied constant may depend 
on q. 

Proof. The first statement has already been recalled. For (|6.4p . we have 

deg(7r)^d ' ' Ifijfid^ 



for d ^ 1, with an implied constant depending on q. 

For ()6.5p . which is the analogue for Fg[r] of the classical Mertens estimate, 
we refer, e.g., to [19], where it is proved in the form 

n 0-r)-?0-< 

dcg{7r)^c( 

for d ^ 1, 7 being the Euler constant; since 

E 7 = >°s<i+7 + o(l), 

we get the stated result. We emphasize the fact that the asymptotic of the 
product in (j6.5p is independent of q (and is the same as for the usual Mertens 
formula for prime numbers), since this may seem surprising at first sight. 
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This is explained by the relation with random permutations, and in fact, in 
Remark 16.101 below, we explain how our argument leads to a much sharper 
estimate (j6.20p for the error term in (16. 5p . 

Finally, for the third statement, if a is a product of rj disjoint j-cycles 
for 1 ^ j ^ d, we first recall the standard formula that 

PK = a)= n -F^' (6.7) 

and we observe that the product can be made to range over i~ [a) ^ j ^ d, 
since the terms j < £~ (cr) have rj = by definition. Using this observation, 
we have by simple counting 



E 1- n 

deg{f)=d ^-(o-)^-^d 



n,(j) 



Furthermore, for any r and j ^ 1 such that r < q^^"^ and j ^ r, we have 

'n,(j)\ _ 1 



r 



-U,{j){U,{j)-l)---{U,{j)-r + l) 



"1 \ ^ J j^/p} 



by the first part of the lemma. Combining the two formulas, we get 

deg(/)=d £-{a)^^d ^ 



£-{a)<ij^d'^^'^ 



e-{a)<ij<id 

and this immediately gives the conclusion since the implied constant in the 
formula for Ilq{j) is at most 1. □ 

Part (3) of this lemma means that, as long as we consider permutations 
a G &(i with no short cycle, so that 

d = 0(g^"M/2), 

there is strong quantitative equidistribution of the conjugacy class Fj among 
all conjugacy classes in S^. 

Thus, to compare the distribution of polynomials and that of permuta- 
tions, it is natural to introduce a parameter 6, ^ 6 ^ d, to be specified 
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later, and to first write any monic polynomial / of degree d as f = gh, where 
the monic polynomials g and h are uniquely determined by 

d+{g)i^b, d-{h)>b (6.8) 

(i.e., g contains the small factors, and h the large ones; they correspond to 
"friable" and "sifted" integers in classical analytic number theory). One can 
expect, by the above, that if b is such that g^/^ is large enough compared 
with d, the distribution of h will reflect that of permutations without cycles 
of length ^ b. And the contribution of small factors should (and will) be 
comparable with the independent model for divisibility of polynomials by 
irreducible ones. 

We now start the proof of Theorem [6TT] along these lines, trying to evaluate 



qd 

deg(/)= 



Writing / = gh, where g and h satisfy (j6.8p as above, we have io{f) 
uj{g) + Lv{h) since g and h are coprime, and hence 

1 •iuuj{g) 

^ Y: ^T(d-deg(<,),6), 

deg(/)=d deg{g)^d 



where we define 



deg(/)=d 
d-(f)>b 



Denote further 



R{d,b)= h s{d,b)= ^ 

\9\ \9\ 

deg{g)>d deg(g)^d 
d+{g)Sib d+{g)^b 

Noting that \T{d,b)\ ^ 1 for all b and d, and splitting the sum over g 
according as to whether deg{g) ^ \/d or deg(g') > Vd, we get 

-1 iuuj(g) 

^ Y -^T{d-deg{g),b)+0{R{Vd,b)) 

Q \Q\ 

dcg(f)=d deg{g)^Vd 
d+{gKb 

= Si + 0{R{Vd, b)), say. (6.9) 



The next step, which is were random permutations will come into play, 
will be to evaluate T{d, b) asymptotically in suitable ranges. 
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Proposition 6.7. With notation as before, we have 

b 

T{d,b) = exp(^-e™^-) E(e^"^('"^))+ 
i=i 

with an absolute implied constant, in the range 

ct''^ ^ d, b^d. (6.11) 

Proof. Before introducting permutations, we separate the contribution of 
squarefree and non-squarefree polynomials in T{d, b) (the intuition being 
that non-squarefree ones should be much sparser than for all polynomials 
because of the imposed divisibility only by large factors): 

T{d,b) = T\d,b)+T\d,b) 

where 

T\d,b) = \ 

deg(/)=d 
d'{f)>b 

and T^{d, b) is the complementary term. We then estimate the latter by 

\TKd,b)\^ E i E 1 

Kdeg(g)<d/2 ^ deg(/)=d 
b^deg(g)^d/2^ deg{/)=d-2deg(g) 



dcg(g)^6 ^ 



1 1 



We can now introduce permutations through the association f Pj 
sending a squarefree polynomial to its associated cycle type. Using (|5.ip . 
we obtain 

o-eSd deg(/)=d 

which is now a sum over permutations without small cycles. Using the third 
statement of Lemma 16.61 we derive 



T\d,b)= Y e™^^"^P(ad = cT)(l + 0' 



b/2 

i~i(T)>b 

E(e--('^'*)l,-(,^)>,) +0(p(r(a,) > b)dq-'/' 
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with an absolute implied constant if g^/^ ^ d. 

Thus the problem is reduced to one about random permutations. Using 
Proposition 16.81 below with e = 1, the proof is finished. □ 

Now recall that the characteristic function E(e™^^'^''^) is explicitly known 
from (14. 7p . This formula, or (14. 9p . implies in particular that we have 

E(e™^(<T<i-,)) ^ E(e™^('"''))(^l + 0(^^)). (6.12) 

Then, inserting the formula of Proposition 16. 71 in the first term Si of (|6.9p . 
and using this formula, we obtain in the range of validity ()6.1ip that 

5i = exp(-e*"^-jE(e^"^("'')) ^ -^-^ + R 

d+{gKb 

where, after some computations, we find that 

< (I E(e^""'("''))|62d^i + ^ (log d)^/^d-^)S{Vd, b), 

with an absolute implied constant. 

Extending the sum in the main term, we get 

Si = M + Ri, 

where 

M = E(e™^('"''))exp(-e™^-j ^ 

i=i ^ d+(g)^b 

Ri « bRiVd^b) + (l E(e-(-))|^ + + ^^^i^^)5(Vd,6). 



Now, we can finally apply (j6.5p and multiplicativity in the sum over g in 
M, to see that 

M = E(e-(-)) n (l-d (l + R^)(^ + ^(6 

dcg(7r)^;) 

and hence, by the mod-Poisson convergence of w{afi) and the absolute con- 
vergence of the Euler product extended to infinity, we have 

lim exp((log(i)(l - e™))M = ^i(n)$2(n), 

d,b^+oo 

uniformly for n G R. 

There remain to consider the error terms to conclude the proof of Theo- 
rem [6TTJ We select b = (logd)^ —>■ +oo; then (j6.1ip holds for all d ^ dQ{q), 
and hence the previous estimates are valid and we must now show that 

exp((logd)(l - e™))i?(\/d,6) ^ 0, exp((logd)(l - e™))i?i ^ 

(the first desideratum coming from (16. 9p ). 



(fR{Vd, (log df) < exp f 2 log d + 2C log log d - ) — ^ 0, 
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Note that | exp((logd)(l - e*"))| ^ d^. Now we claim that 

Rid,b) <^b^e-'^/'' (6.13) 
S{d,b)<^b, (6.14) 

for 1 ^ 6 ^ d and absolute constant C > 0, with absolute implied constants 
for the first, and an implied constant depending only on q for the second. 
Granting this, we have 

Vd 
(1^2 

and all terms in Ri are similarly trivially estimated, except for 

exp((log(i)(l -e^"))|E(e*"^(^''))|52(i-i5(\/d,6) < b^d'^ — > 0, 

using again the mod-Poisson convergence of w{ad)- 

We now justify (j6.14p and (16.13P : for the former, by ()6.4p . we have 

is(\/d,6)K n 

deg(7r)<b 

and for the latter, we need only a simple application of the well-known 
Rankin trick: for any a ^ 0, d ^ 1 and g G Fg[X], we have 

1 J , ^ J < «'^(dGg{9)-'i) 
^Aeg{g)>d ^ S/ ) 

and hence, by multiplicativity, we get 

R{d,b) ^ q-"'^ g^""^^'^"'^^^^ = q^""^ n " 

which we estimate further for a using 

, |„|fe((T-l). 

n (i-w"-'r' = »p( E 

deg(7r)^6 dcg(7r)^6fc^l 



6 



^exp(C7^^) ^ exp(CV''log6) 



.=1 ^ 

for some absolute constants C, C > 0. Taking a = 1/(6 log q) leads imme- 
diately to ([6331) . 

Finally, here is the computation of the characteristic function of the cycle 
count of permutations without small parts that we used in the proof of 
Proposition 16.71 



Proposition 6.8. For all d^ 2 and 6^0 such that b ^ d, we have 

b 



E(e--(-'')l,_(,^)>,) = exp(-e» ^ -) E(e^--('^^)) 

+ 0(|E(e'"^('^'*))|6i+^(i-i + 63(logfi)^/2^-2), (6.15) 
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for any e > 0, where the implied constant depend only on e. 

Proof. This is essentially a sieve (or inclusion-exclusion) argument, which 
may well be already known (although we didn't find it explicitly in our survey 
of the literature). To simplify the notation, we will prove the statement by 
induction on b, although this may not be necessary; taking care of the error 
terms is then slightly more complicated, and readers should probably first 
disregard them to see the main flow of the argument. 
We denote 

$d,b(n) = E(e™-('^'*)l,-(,,)>b), <I>d(n) = '^dA^), h = Y.-. 
We will write 

^d,b = exp(-e*'^/ife)$rf + \^d\Ed,b + Fd,b, (6.16) 

where E^.t, F^^b ^ 0; such an expression holds for 6 = 0, with E^fl = F^.o = 
0, and we will proceed inductively to obtain an expression for ^^.b from 
that of ^d',b-i, d' ^ d, from which we will derive estimates for Ed^j, and 
Fd^b in general. Note that we can assume that d is large enough (i.e., larger 
than any fixed constant), since smaller values of d (and b) are automati- 
cally incorporated by making the right-most implied constant large enough 
in (j6.15|) . Also, we can always write such a formula with \FdM\ b, for some 
absolute constant, since the characteristic functions are bounded by 1 
and exp(— e™/i;,) <^ b. 

Now, with these preliminaries settled, let / be the set of 6-cycles in (3^; 
we write r | a (resp. r f a) to indicate that t G I occurs (resp. does not 
occur) in the decomposition of a in cycles. Then we have 

$,,6(n) = E(e--('^'^)l,-(,^)>,) 

£-(o-)>fe-l £-(<T)>fe-l re/ 

We expand the product as a sum over subsets J C /, and exchange the 
two sums, getting 

' •^c/ e-{cr)>b-l 
r6 J=>t|o- 

Now fix a J C / such that the inner sum is not empty. This implies of 
course that the support of the cycles in J are disjoint, in particular that 
those cycles contribute \J\ to ro(cr). Moreover, if we call A the complement 
of the union of the support of the cycles in J, we have \ A\ = d— | J|6, and any 
a in the inner sum maps A to itself. Thus, by enumerating the elements of 
A, we can map injectively those a to permutations in &d-\j\b, and the image 
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of this map is exactly the set of those cJi G &d-\j\b for which i (o"i) > 6 — 1. 
Moreover, if a maps to ai, we have 

w{a) = I J| + vj{ai), 

and thus we get 



E 



i-(a)>b-l creed_|j|i, 
TeJ=>T|(T £-(o-)>6-l 



and then 



Jc/ 

(d- |J|fe)! ^ |j| 
- 3j ) '^d-\j\h,b~i[u), 

JCI 

the sum over J being imphcitly restricted to those subsets of / for which 
there is at least one permutation in where all cycles in J occur. 

In particular, we have |J| ^ d/b (so there is enough room to find that 
many disjoint 6-cycles), and if we denote by S{k,b) the number of possible 
such subsets of / with \J\ = k, we can write 

'^^'^ (fJ-khV 
<^d,b{u) = Y^S{k,b)^^^i-en'^d-kb,b-i{u) 

k=0 

Now we claim that 



J-kbJ b^k\ {d-kb)\b^k\' 

Indeed, to construct the subsets J with \J\ = k, we can first select arbi- 
trarily a subset A of size d — kb in {1, ... ,d}, and then select, independently, 
an arbitrary set of k disjoint 6-cycles supported outside A. The choice of A 
corresponds to the binomial factor above, and the second factor is clearly 
equal to the number of permutations a S &kb which are a product of k dis- 
joint 6-cycles. Those are all conjugate in 6kb, and their cardinality is given 
by (j6.7p . applied with d replaced by kb and all rj = except for rf, = k. 

Consequently, we obtain the basic induction relation 

^ /-e^"\fc 1 

'^ddu) = 2^ (-^j -<i>d-kb,b-iiu). 

k=0 

Before applying the induction assumption (j6.16p . we shorten the sum 
over k so that ^d-kb,b-i will remain close to ^d,b-i- For this, we use the 
inequality 

mi. ^ 
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for \z\ ^ 1, m ^ 0, as well as \(^d-kb{u)\ ^ 1, and deduce that 

, iu , /In 

^'^.''M = E (-r) ^^-^-^M-iW + (6.17) 

k=0 

for some m to be specified later, subject for the moment only to the condition 
m < d/2b, and an implied constant which is at most 1. 
By (|6.16p . we have 

^d-kb,b-i{u) = exp(-e™/i;,_i)^>d_fcfe(u) + \^d-kb{u)\Ed-.kb,b~i + Fd-kb,b-i- 
Moreover, by (|6.12p . we also know that for k ^ m, we have 

^d-kbiu) = E(e--(-''))(l + 0(^)) = cl>,(n)(l + (6.18) 

with an absolute implied constant. Hence, we obtain 

$rf,,(n) = exp(-e^" Vi)$rf(n)M + R + S 



where 



k=0 



m ^ 

I'^l ^ E J^Fd-kb,b-l\^d-kb{u)\ 
k=0 

k=0 

m ^ 1 

fc=0 

We next write 

^ Vd^ 6*^-1(^-1)!/ Vm! 

where the implied constants are absolute, and deduce that 

$rf,6(ti) = exp(-e^"/i6)$d(n) + \Mu)\Mi + R + S, 

with 



1 

'II < ■ 

ml 

where the implied constant is absolute. The desired shape of the main term 
is now visible, and it remains to verify that (for a suitable m) the other 
terms are bounded as stated in the proposition. 

First, comparing with (j6.16p . with the terms in Ri and R contributing to 
Ed^b, while those in 5 contribute to Fd^b, we see that we have 



m 



^'i'>' ^ E -^\^d-kb,b-l + 
fc=0 
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We now select m = [log d\ . Then, together with Fdfl = 0, we claim that 
this inductive inequality implies 

Fd,b^Cb^ {log d)'^^d~^, (6.19) 

for 6 ^ d and some absolute implied constant C ^ 1. For a large enough 
value of C, note that this is already true for all d ^ do, where do can be any 
fixed integer. We select do so that 

1 ^ 1 
m! ^ d^ ' 

for d ^ do, and we can thus assume that d > do from now on. 

The desired bound holds, of course, for 6 = 0. It is also trivial if b{\og d) ^ 
d/24 (say), because we have observed at the beginning that (I6.16P can be 
obtained with Fd^ <C h. If it is assumed to be true for all d and 6 — 1, we 
have for 6(logd) < d/24 that 

1 1 

k=0 

^ 1 C(logd)V2(6-l)3 1 / fc6x-2 
^ m! d2 ^ bf^kl V d) ' 

k=0 

We note the following simple inequalities 

3x 

(1 - x)"^ ^ e^'', exp(x)^l + y, for 0^x^1/2, 

(x - l)e^/^ ^ X, for ^ X < 1, 

and from them we deduce that if 6(logd) < d/24 (so that kb/d ^ 1/2 for 
the values involved), we have 

1 / A;6\-2 ^ I /exp(46/d)\fc /I 6 

A:=0 fc=0 

and, hence (from the same simple inequalities) we get 

('-i)'Ei^(i-f)"<('-''"^'<('-^'-Ks 

A;=0 

>-l)expQ)'^'^ (6- 1)3/263/2. 

By the choice of do, we deduce for 6^1 and d > do that we have 

Fd,b ^ d-\logd)^/\l + Cb^/\b-lf/^) ^ Cd-%^ 

(assuming again C large enough), completing the verification of (j6.19|) by 
induction. 
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Finally, from (I6.16P and the foregoing, we deduce similarly that 

k=Q 

for some absolute constant D ^ 0. Fix e > 0, and consider the bound 
then if C ^ 1, assuming it for 6 — 1, we obtain the inductive bound 

k=0 

^ d'' [Dil + e'l') + C{b - exp(i + ^^^) } 

(using again the elementary inequalities above). Then for d ^ di{e), pro- 
vided C ^ 1, we obtain 

Ed,b < Cb'+'d-\ 

confirming the validity of this estimate. □ 

Remark 6.9. Proposition [6i8] can itself be seen as an instance of mod-Poisson 
convergence, for the cycle count of randomly, uniformly, chosen permuta- 
tions in Gd without small cycles. 

Precisely, let S^^'' denote the set of fi G with l'^ {a) > b. We then find 
first (by putting u = in Proposition 16. 8p that 



-— - ~d,6^+oo exp - > - 



provided b is restricted by 6 <C d^^^ ^ with e > arbitrarily small. Then, 
for arbitrary li S R and b similarly restricted, we find that 

1 ^ 1 

_ ^ exp((l - e-) -) E(e--('^^)), 



locally uniformly. Thus the mod-Poisson convergence (j4.8p for w{a(i) implies 
mod-Poisson convergence for the cycle count restricted to S^^'' as long as 
b <C ci^/^~^, with limiting function l/r(e*'*) and parameters 

logd-^ - ~log|. 



It may be that the restriction of b with respect to d could be relaxed. 
However, in the opposite direction, note that for 6 = d — 1, the number of 
d-cycles in S^, i.e., |6^'^ ^^|, is {d— 1)!, so the ratio is 1/d which is obviously 
not asymptotic with e'^'^ /{d — 1). 
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Remark 6.10. We come back to the asymptotic formula ()6.5p . to explain 
how it follows from Theorem 16.11 in the sharper form 

n (i-^)=»p(-E7)(i+o(-i,))' «=-^«> 



Namely, it is very easy to derive this asymptotic up to some constant: 

where 7^ is given by the awkward, yet absolutely convergent, expression 

II II m ^ 

From this, the flow of the proof leads to the mod-Poisson limit (16. ip . with 
an additional factor exp(— 7^6*") in the limit. But for u = 0, both sides 
of (|6.ip are equal to 1, so we must have exp(7g) = 1 for all q. (This is 
another interesting example of the information coming from mod-Poisson 
convergence, which is invisible at the level of the normal limit; note in 
particular that this is really a manifestation of the random permutations.) 

7. Final comments and questions 

Many natural questions arise out of this paper. The most obvious concern 
the general notion of mod-Poisson convergence, and its probabilistic signifi- 
cance and relation with other types of convergence and measures of approx- 
imation (and similarly for mod-Gaussian behavior). Already from [llj, it 
is clear that mod-Poisson convergence should be a very general fact in the 
setting of "logarithmic combinatorial structures" , as discussed in [1] . 

In the direction suggested by the Erdos-Kac Theorem, there is a very 
abundant literature concerning generalizations to additive functions and 
beyond (see, e.g., the discussion at the end of [9j), and again it would 
be interesting to know which of those Central Limit Theorems extend to 
mod-Poisson convergence, and maybe even more so, to know which don't. 

In the direction of pursuing the analogy with distribution of L-functions, 
the first thing to do might be to construct a proof of the mod-Poisson Erdos- 
Kac Theorem for integers which parallels the one of the previous section. 
This does not seem out of the question, but our current attempts suffer from 
the fact that the associations of permutations in "©logAr" to integers n ^ N 
that we have considered are ad-hoc (though potentially useful), and do not 
carry the flavor of a generalization of the Frobenius. It is then difficult 
to envision a further natural analogue of a unitary matrix associated, say, 
with C(l/2 + it). One can suggest a "made up" matrix Ut obtained by 
taking the zeros of C,{s) close to t, and wrapping them around the unit 
circle after proper rescaling, but this also lacks a good a priori definition - 
though this was studied by Coram and Diaconis [5] , who obtained extremely 
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good numerical agreement; this is also close to the "hybrid" model for the 
Riemann zeta function of Gonek, Hughes and Keating [7j. 

One may hope for more success in the case of finite fields in trying to 
understand (for instance) families of L-functions of algebraic curves in the 
limit of large genus, since the definition of a random matrix from Frobenius 
does not cause problem there (though recall it is really a conjugacy class). 
However, although we have Deligne's Equidistribution Theorem in the "ver- 
tical" direction q — > +00, and its proof is highly effective, it is not clear what 
a suitable analogue of the quantitative "diagonal" equidistribution (16. 6p in 
Lemma [6.6l should be. More precisely, what condition should replace the re- 
striction to polynomials without small irreducible factors? We do not have 
clear answers at the moment, but we hope to make progress in later work. 

Finally, it should be clear that analogues of mod-Gaussian and mod- 
Poisson convergence exist, involving other families of probability distribu- 
tions. Some cases related to discrete variables are discussed in ^ §5], and 
one may also define "mod-stable" convergence in an obvious way (though 
we do not have interesting examples of these to suggest at the moment). 
It may be interesting to investigate links between these various definitions; 
the last part of Proposition 12.41 suggests that there should exist interesting 
relations. 
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